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5 BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates generally to information retrieval from the Internet or 
other search spaces, and more particularly to a system of accumulating and organizing 
information collected from multiple remote locations in a central database, and implementing 
10 collected information in an open recommendation system. 

2. Description of the Related Art 

Presently, the nature and the structure of the Internet and other vast search spaces, 
combined with current technology and Internet exploration methodologies, present users with 
many problems for which current technology has offered no solutions. For instance, while an 
15 overwhelming amount of information is obtainable on the Internet, the available information 
content is limited only to materials accessible through Hypertext Markup Language (HTML). 
Furthermore, present methods of organizing and presenting the information which is 
available to users are inadequate. 

While some sites on the World Wide Web, for example, may provide users limited 
20 access to trends or suggestions based upon other users' behavior at that particular site, there 



is currently no expansive system which provides one user with global access to feedback and 
other information offered by other users. Moreover, since the current state of the art neither 
recognizes nor accommodates personal preferences or biases, exploration of the search space 
is less interactive, and consequently less efficient and less productive, than it otherwise could 
5 be. 

In the context of the World Wide Web, for example, the information presently 
available for users is limited to the HTML documents which make up the searchable hyper- 
space. While most users consider interaction with other users and recommendations by other 
users to comprise the most efficient mechanisms for identifying useful and relevant 
10 information, current systems employed for information organization and retrieval are not 
adapted for providing a centralized server through which convenient and unrestricted user 
access to input from other users is enabled. 

Additionally, while many users turn to the Internet for information concerning other 
media types, for example, television and radio, these and other media which are not HTML 
15 based are completely neglected by the Internet navigation and information retrieval systems 
currently in use; that is, access to these various other media is not possible through current 
HTML navigation methods and systems. This inaccessibility or elimination of other media 
sources which are not based upon HTML documents from current search and navigation 
systems represents a significant limitation of the state of the art. 

20 Moreover, where limited interactive functionality is provided by current technology, 

the interaction is typically restricted to information exchange between one user and the site 
host, i.e. a closed system, rather than expanded to include information exchange between the 



various users of the site in an open recommendation system. As a result, current systems 
provide search results which do not take into account the preferences of users of such a 
closed system. 

There has been a continuing and growing need, therefore, for an open, or distributed, 
5 recommendation system and method for identifying and accessing the most relevant 
information from the Internet or other vast search spaces. Such an open recommendation 
system and method should be customized so as to take into account the preferences and 
biases of the individual navigating the search space. Furthermore, a truly integrated system 
should support access through HTML to a variety of other media which are not traditionally 
10 HTML based, such as television and radio, for example. Additionally, since human input is 
generally the most preferred source of information, an open recommendation system and 
method should be adapted to incorporate human input as data into a central database, to 
facilitate retrieval of that information, and to encourage human interaction. 

SUMMARY OF THE INVENTION 

15 The system and method of the present invention overcome the foregoing and other 

shortcomings of known methodologies by employing open recommendation technologies 
which utilize a distributed monitoring system to accumulate a vast open database of 
knowledge and information which incorporates human input as data. In the terminology of 
the invention, this open database is referred to in some places as an open knowledge base. 

20 The present invention operates to collect both explicit data and implicit data using a 

de-centralized, or distributed, monitoring system which provides for data collection across a 
broad range of remote sources. Software may be installed at remote terminals for monitoring 



purposes; as a result, data may be monitored locally at the client side, and subsequently 
transmitted to a central database. Additionally, the system and method of the present 
invention provide for aggregation, at the server, of the data collected on the client side from 
multiple remote sources. During the aggregation process, data collected by the distributed 
5 monitoring system are categorized and organized in a central database for convenient 
retrieval. 

Data may be collected on the client side as explicit data, which may be input directly 
by a user in the form of recommendation, comment, or vote, or as implicit data, which may 
be collected by the system according to the user's browsing activity; further, the system may 

10 collect both explicit data and implicit data concurrently or in combination. Implementation 
of the collected data includes both transmitting explicit data on demand as well as utilizing 
explicit data, implicit data, or a combination of both explicit and implicit data, in an open 
recommendation system which facilitates customization and personalization of the 
information retrieval process. According to one aspect of the present invention, a user may 

15 be provided with the option of turning off, or "deselecting," the implicit data collection 
functionality of the system. 

The system and method of the present invention operate to gather and to collect 
human knowledge by monitoring users' activities on the client or browser side, rather than on 
the server side. Data collected on the client side relating to user behavior and preference may 
20 subsequently be mined, converted, and manipulated through statistical analysis at the server. 
Such data collected may include some or all of the following: the Universal Resource 
Locators (URLs) associated with the sites visited; duration of time spent at each site; 



keywords the user associates with each site; explicit comments input by the user relating to 
each site; user evaluation of each site via voting; acceptance or rejection of recommendations 
related to URLs or people; other users whose name cards were accessed and/or collected; the 
people whom the user contacted; and user H). 

5 Additional information may be obtained from the sites and URLs themselves, and 

other multimedia data may be obtained such as information related to television and radio 
broadcasts, people, MP3, and telephone numbers. As a user visits each site, relevant 
information is collected. The information collected may be categorized as either explicit or 
implicit; both types of data may be collected simultaneously. Implicit information includes 
10 the names of Web sites, in terms of URLs, while explicit information includes vote 
responses, comments, and preferences with respect to each Web site. 

In contrast to the various technologies currently in use, the system and method 
embodied in the present invention may be characterized as distributed, or "open," since 
neither the nature nor the location of information is a barrier to data collection, aggregation, 

15 and subsequent implementation. As an example, the system and method described herein do 
not depend for their utility upon portal-type technologies which favor information offered by 
licensed content providers. Accordingly, the inventive system and method are adapted to 
collect and to aggregate data with respect to the entire search space; that is, since data is 
monitored at the client side and stored at a central server, the information available to be 

20 collected and utilized is not selectively limited by the operator of a portal site or other 
information content provider, but rather extends to any and every site which may be 
reachable from the client side. 



Additionally, monitoring may not be limited to browser or other Internet tool 
technology, but rather may extend to other types of terminals such as radios, televisions, 
telephones and Personal Communication Systems (PCSs), Personal Digital Assistants 
(PDAs), video recorders, set top decoder boxes, and the like. These various other terminals 
5 may be suitably connected, such as via wire line or wireless technology which is known in 
the art, to an Internet capable terminal such as a personal computer equipped with a browser. 
Through appropriate hardware connections and software communications, the system may be 
adapted to monitor activity conducted on these other terminal types. 

In this regard, the system and method of the present invention are adapted to integrate 
10 the HTML platform with other media, such as television, radio, newspaper, and people. 
Presently, HTML is the language of choice for Web design technology; consequently, users' 
access to information content is limited by Web browser convention, i.e. restricted to HTML 
documents or pages. While current browser software allows display of information in 
various media accessible through HTML, it is not possible to provide users with access to 
15 other media. 

Conversely, the present invention not only provides access to other media which 
cannot be displayed by HTML, such as real television or radio broadcasting, for example, but 
also maintains database records of information, monitored at the client side, concerning these 
alternative media types; in this manner, multimedia data are searchable in the same manner 
20 as ordinary HTML information. Current technology allows a user to search for Web sites or 
URLs focused upon a particular topic, for example; in response, the search engine provides a 
list of URLs which are related to the topic of interest. The present invention, in addition to 



returning a list of relevant URLs or Web sites, is further adapted to provide a list of people, 
television programs or channels, radio broadcasting information, and the like, which is also 
related to the topic of interest. 

Hyperlinks displayed on the HTML page may link the user directly to a particular 
5 television or radio program or other live broadcast, which may be presented to the user at a 
different terminal or device. Selecting such a link displayed on the HTML page activates 
software programming designed to interact with the other media beyond the scope of HTML. 
As noted above, the software code of the present invention, in conjunction with appropriate 
hardware connections, may control the content displayed or accessed through other types of 
10 terminals such as televisions, video recorders, and radios, for example. 

The present invention utilizes browser-embedded software to enable client side 
monitoring of user activity. The embedded software may be adapted to provide such features 
as search engines, advertising, surveys, slot machines, and lotteries. Conventional browser 
software has an integrated feature which allows incorporation of the system's software. As 
15 noted above, the embedded software program code may provide the following: monitoring of 
user activity on the client side; search engine functionality; banner advertising which may be 
customized for user preferences; survey or voting functions; and lotteries. 

Since the foregoing features are incorporated into the software code installed on a 
user's terminal at the client side, the features are operational irrespective of which URL is 
20 currently loaded into the user's browser. That is, the user need not visit a particular Web site, 
such as a search engine Web site, for example, in order to take advantage of the search 
engine feature embedded in the browser software. Since the search engine function is 



included in the browser embedded software which accesses the database maintained at the 
server, a user need not navigate to a traditional search engine site before initiating a search. 

Enhanced banner advertising functionality is also provided. Conventional banner 
advertising methodologies, for example, are dependent upon data monitored on the server 
5 side; consequently, these systems can only customize advertising presented to a user while 
the user is visiting the particular site which performs the server side monitoring. By 
integrating the banner advertising feature into the browser embedded software, and through 
utilization of the data monitored on the client side and stored at the central server, the 
inventive system can present customized banner advertising to a user irrespective of the 
10 user's location in the search space. 

Additionally, the browser embedded software is capable of communicating with 
various multimedia terminals, as discussed briefly above. Through software control of 
network hardware devices, the system software embedded in the browser enables data 
monitoring and collection from non-HTML based sources, as well as control of various 
15 multimedia terminals. 

The present invention additionally provides an advanced search engine ranking and 
filtering method which takes advantage of monitored data and user voting history. The 
conventional search engine orders search results in accordance with one or more 
predetermined algorithms, for example, which may be based upon metatags or other 
20 metadata, the number of occurrences of a queried keyword, the location of that keyword 
within a document, or some combination of these factors. Consequently, the typical search 
engine provides the same results in the same order for every search based upon a given 



keyword; these results are based upon neither a measure of the real relevance of the various 
sites with respect to a given keyword nor users' evaluations of the information provided at 
those sites. 

While there have been many attempts to provide better search results by improving 
5 existing search engines, there have been very few successful efforts. The inventive system 
and method enable more meaningful search results by utilizing implicit and explicit data 
recorded in the central database to refine searches based upon what is known about each site 
which may potentially be returned as a result of any given search. Access history and voting 
records, collected from all the users of the system, are utilized to evaluate and to rank 
10 potential search results. The results returned are more meaningful as a result. As the 
database grows, search results for the same query improve over time. 

For example, the list of URLs returned by current search engine technologies often 
include dead links which are no longer operational or have not been updated for a very long 
time. The inventive strategy of using explicit and implicit monitored data to evaluate each 
15 URL eliminates the likelihood of dead links being returned as search results, since users are 
unlikely to provide a vote of approval for such sites, and consequently, fewer users are likely 
to visit such a site or to spend any significant amount of time there. 

Whereas current search engines may erroneously assign a higher relative priority to 
links actually having less relevance with respect to a particular query, this limitation is 
20 overcome through examination of explicit data concerning each site; human input may be 
analyzed as a factor in the relevance determination, thereby making the assessment of 
relevance more reliable and the search results, in turn, more accurate. 



Additionally, a personal or customized search engine may be based upon privately 
monitored data concerning each respective user of the system. This search functionality is 
similar to that discussed above; in accordance with this particular aspect of the present 
invention, however, individual data is applied for each user conducting a search. The system 
5 can, therefore, provide customized search results by utilizing user profile information 
collected for each respective individual. Accordingly, the search results for each individual 
user may be different, even if two users conduct a search using the same keyword, since each 
user profile is unique. Where a first user, interested in Toyota automobiles, for example, and 
a second user, favoring, say, Honda automobiles, each performs a search using the keyword 
10 "automobile," the system of the present invention ranks Toyota related sites higher for the 
first user and Honda related sites higher for the second user. 

Such a search strategy reflects each person's profile data, and may be adapted to 
provide the search results in order of decreasing relevance with respect to each searching 
user's unique profile data. Once again, the more user information is available, the better the 
15 results over time for a given query. As the user's profile becomes more comprehensive, the 
system has more and better data which may be used to customize the search. 

In this manner, user preferences may be considered as a driving factor in a search 
process, as in the automobile example above. While many Web-based search engines of 
conventional design attempt to order, or to rank, a list of URLs according to relevance with 
20 respect to a query term, for instance, current technology does not account for the preferences 
and habits of the individual user. The system and method of the present invention, on the 
other hand, create a unique experience for each individual user by employing data 

-10- 



accumulated with respect to each particular user's preferences so as to rank search results in 
an order more customized to each user's personality and tastes. 

According to another aspect of the present invention, a multimedia search engine 
utilizes and accesses various kinds of information via user or human input. Accordingly, 
5 search results may include itemized lists or information concerning people, television, video, 
MP3, and others, in addition to ordinary URL results. Since human knowledge is 
incorporated into the system and catalogued in a central database, information outside the 
realm of HTML, such as television, radio, MP3, and so forth, may be included. The 
references to these other media may be expressed in the HTML, and links may be provided 
10 for access thereto. 

Through selection of a link representing a television program, for example, the 
corresponding television program may be displayed on a television connected to the system 
as discussed above. Further, if a user were to conduct a search using the title of a television 
program as a keyword query, for example, the search result may yield the names of people 
15 who share preferences and habits which are consistent with the theme of the television 
program. By following a link from one of the people returned as a search result, which link 
may be shown on the HTML page, the television program may be viewed on a television 
connected to the system. 

In accordance with this aspect of the invention, a given search result may return many 
20 links to multimedia content beyond HTML. From one or more such links, there may be 
multiple levels of links emanating therefrom. That is, rather than just returning a link to a 
television program, or to a particular user's relevant personal information, the search results 



may return additional link levels to other users, other types of programming (not only 
television, but also MP3, or radio, or the like), and additional link levels beyond that. The 
result will be a tree of links. Thus, for example, a link to a particular television program may 
yield a link to other types of multimedia content, including references to individuals, their 
5 preferences, and the like. These links in turn may link to other types of multimedia 
information beyond HTML. 

Another important aspect of the present invention involves an open site 
recommendation service. Explicit user input and implicit user data, derived from user 
behavior, may be monitored such that the preferences and biases of an individual user may be 
10 known and stored in a central database. These preferences and biases may subsequently be 
employed by the open recommendation system and method to personalize the process of 
information retrieval and to assist the user in making decisions concerning information which 
may be most desirable for a given user. 

Since a vast array of data is stored at the central database concerning HTML pages, 
15 for example, the system may employ program code adapted to recommend certain sites 
which are consistent with data in an individual user's profile. Where the system makes 
recommendations or suggestions based upon information known about the various sites 
recorded in the database, as well as upon explicit and implicit data concerning the user's 
preferences, information location and retrieval may be more efficient for the user. 

20 Importantly, the inventive recommendation functionality is enhanced by the client 

side monitoring aspect of the present invention; every Web site, URL, person, multimedia, or 
other type of information source which has been recorded in the central database may be 
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recommended by the system. Whereas typical portal sites, for example, only recommend 
information owned or licensed by the portal operator, the present invention is not so limited, 
and can recommend sites or other information sources from anywhere in the search space, 
regardless of the location or ownership of that information. As such, in accordance with this 
5 aspect of the invention provides the function of what the inventor has termed a "knowledge 
service provider," or KSP. The information that is accumulated from the client side, of 
course, constitutes knowledge; making that knowledge available to users in various ways, 
including but not limited to recognition of the preferences of a given user and the offering, to 
that user, of relevant information related to those user preferences. 

10 In accordance with still another aspect of the present invention, the system and 

method are adapted to offer recommended or suggested communication with particular 
people. Through efficient utilization of the implicit and explicit data collected for each user, 
the present invention allows one user to find other, compatible people to contact. This 
facilitates communication between users, and encourages information exchange on a large 

15 scale. 

Conventional chat rooms, newsgroups, or Bulletin Board Services (BBS), all of 
which facilitate running dialogue among users, are deficient with respect to users' 
preferences, or "profiles." Importantly, any user profiles which are created at such locations 
are exclusively local, and users are enabled to communicate only with other users of that 
20 local system, i.e. these systems are inherently closed or concentrated. Users employing 
conventional technology must first search to find an appropriate site or forum in which to 
communicate; only then can users attempt to find other people with whom to correspond. 
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The present invention, in contrast, may employ both explicit and implicit data in 
users' profiles to connect users with common interests, similar personalities, or mutual 
friends, without the limitations inherent in current, closed systems. A user need not search 
for a forum or other chat site, for example, since the system itself provides the mechanisms 

5 for communication; other users and their explicit data are readily provided by the system, 
along with appropriate user profile data to facilitate selection of a person having similar 
interests. Since implicit data concerning, for example, favorite URLs or most commonly 
searched keywords, are incorporated into a user's profile, other users may be assured that the 
system's suggestions or recommendations concerning compatible people are accurate. This 

10 is another aspect of the "knowledge service provider" (KSP) feature of the invention. 

Another feature of the invention follows on an earlier aspect of the invention, which 
collects good information content. According to this further aspect of the invention, a service 
is provided both to select and to recommend that good information content to users. This 
aspect of the invention removes the need to continue to collect necessary information 
15 content, and the need to keep the contents fresh. Those efforts are costly and time- 
consuming, and have not been terribly efficient. The just-mentioned needs are met by other 
aspects of the invention, and by Web site owners and Internet users, particularly those 
owners and users availing themselves of those other aspects of the invention. 

Importantly, the system and method of the present invention are adapted to provide 
20 matching degree meters, or compatibility gauges, which indicate the degree of similarity 
between the user and the page or URL currently loaded in the user's browser, for example, or 
between the user and other members or users of the system. A graphical display or 
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numerical indication may be provided to the user with respect to the degree of "matching" or 
similarity. Such degree may change as a result of various factors: for example, the content of 
a particular site may vary over time, or the information in the user's profile may change 
during ordinary and increasing use of the system. As noted above, matching information 
5 may include the matching degree between one user and other people making use of the 
system. Given an indication of the relative matching of information in various users' 
profiles, one user may select the link to the user having the most closely matched profile 
rather than selecting a different user whose preferences appear to be relatively incompatible. 

The above-mentioned and other attendant advantages of the present invention will 
10 become more apparent upon a detailed examination of the preferred embodiments thereof 
with reference to the attached drawings, described below. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a simplified diagrammatic view of one embodiment of a system 

constituted in accordance with the present invention. 

15 Figure 2 is a simplified diagrammatic view of one embodiment of a distributed 

monitoring process employed by the present invention. 

Figure 3 is a simplified diagrammatic view of one embodiment of a data aggregation 
process employed by the present invention. 

Figure 4 is a simplified diagrammatic view of one embodiment of an open 
20 recommendation system and method of the present invention. 
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Figure 5 is a general diagrammatic overview of the functionality of the system and 
method of the present invention. 

Figure 6 is a representation of a Graphical User Interface employed by standard Web 
browser software which has been modified in accordance with the present invention. 

5 Figure 7 is a representation of search results ordered in accordance with one aspect of 

the present invention. 

Figure 8 is a representation of search results ordered in accordance with another 
aspect of the present invention. 

Figure 9 is a simplified diagrammatic view of one embodiment of the distributed 
10 monitoring system of the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Turning now to the drawings, Figure 1 is a simplified diagrammatic view of one 

embodiment of a system 100 generally constituted in accordance with the present invention. 
A central server 120 is provided with program code, such as JAVA® code, for example, or 

15 other suitable programming, which may be accessible from any Internet capable machine. A 
user at a remote location may have access to the Internet, and therefore to the central server 
120, through a workstation, desktop, or portable notebook or laptop computer, for example, 
or through a wireless or handheld terminal, such as a portable Personal Communication 
System (PCS) or Personal Digital Assistant (PDA) device. Internet access may be through 

20 traditional land-line telephone connection, through higher bandwidth connections such as 
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ISDN, DSL, T-l, T-3, coaxial, or fiber optic cables, or through cellular or wireless digital 
technology, for example, as is known in the art. 

The program code at the central server 120 includes a central database for recording 
and categorizing information, and may be further adapted for communicating with additional 
5 program code 140 at a remote location, such as a user's computer terminal. The program 
code 140 may be transmitted or downloaded from the central server 120, for example, or 
installed onto the user's terminal from magnetic, optical, or other media. In the exemplary 
embodiment illustrated in Fig. 1, the program code 140 may interface with Internet 
navigation software, such as may be contained in information terminal 180; alternatively, the 
10 program code 140 may provide Internet navigation functionality. 

It will be appreciated by those of skill in the art that typical browser software 
contained in information terminal 180 is customarily adapted to be modified by additional 
software, such as program code 140, which may both alter the Graphical User Interface 
(GUI) presented to the user by the browser software as well as provide additional 
15 functionality. The program code 140 may incorporate the following functions: "browser 
side," or "client side," monitoring of user browsing activity; search engine functionality; 
banner advertising, which may be customized or personalized according to monitored user 
preferences; survey or voting functions; and lottery, slot machine, or other types of games of 
chance. 

20 The provision of advertising, or games, as part of the additional functions is well 

within the abilities of the ordinarily skilled artisan. The provision of advertising, per se, is 
known; however, it is within the contemplation of the invention to customize the advertising 
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based upon user or group profiles and preferences, utilizing known types of statistical or 
weighted functions. Importantly, since the advertising function is integrated into the program 
code 140 at the client side and does not depend for its utility upon server side monitoring at a 
particular Web site, for example, targeted or customized advertising is not restricted by the 
5 particular URL which is currently loaded in the browser software 1 80. 

Alteration of the GUI may be effectuated through insertion of a tool bar or other 
interactive interface, for example, which may include drop-down menus, operative buttons, 
and the like, through which the user may gain access to, and maintain control over, the 
capabilities of the system. In this regard, the program code 140 not only facilitates 
10 communication between the user's terminal and the central server 120, but also serves as the 
front end interface between a user and the system of the present invention. 

In operation, the program code 140 at the user's terminal may be adapted to accept 
data input directly from the user, and also may monitor the activity of the browser software 
so as to collect information concerning that user's browsing behavior, habits, preferences, 

15 biases, and so forth. For example, when the user's browser software connects with or 
navigates to a particular Web site, such as one which may be maintained at Web server 198, 
the program code 140 may transmit information concerning that interaction from the user's 
computer terminal to the central server 120. Exemplary data which may be monitored in this 
embodiment may include the Universal Resource Locator (URL) of the Web site visited by 

20 the information terminal 1 80 via the browser software; URL information may be parsed out 
of the Hypertext Markup Language (HTML) code for the Web site. Keywords or query 
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terms which the user submitted prior to navigating to the Web site, as well as the duration of 
the visit, may additionally be recorded. 

As indicated in Fig. 1, data are monitored or collected at the user's terminal, or at the 
browser side, or client side. This data collection may be through direct input from the user or 
5 through the interface between the program code 140 and the browser software in information 
terminal 180. The program code 140, then, may transmit collected data to the central server 
120 for recordation, categorization, and aggregation with data collected from other users. In 
the case where secure or private transmission is desired or required, transmission between the 
program code 140 at a remote terminal and the central server 120 may be through a firewall 
10 1 3 0 or other security mechanism. 

The information collected by the program code 140 at the remote terminal may 
generally be categorized as either implicit data or, alternatively, explicit data. In the case 
where the program code 140 is simply monitoring the user's browsing behavior as discussed 
above, for example by tracking URLs visited and the duration of each stay at a given site, the 

15 data collected are classified as implicit, since input is neither actively solicited from the user, 
nor received directly as a specific user action. A user may wish to deactivate the implicit 
data monitoring function. Explicit data, on the other hand, may be collected in the form of 
information input by the user through direct interaction with the program code 140. As an 
example, the user may select an operative button provided by the enhanced interface enabled 

20 by the program code 140, and such a selection may invoke a program script which allows the 
user to conduct a keyword search, for example, or to input written comments or opinions 
concerning the quality of the information provided at the site which the user is currently 



visiting. Alternatively or additionally, the program code 140 may enable the user to engage 
in ongoing dialogue with other users simultaneously visiting the site, or to post messages for 
other transient users. The user also may provide explicit data by casting a vote. 

Implicit and explicit data may be collected concurrently or simultaneously. In one 

5 embodiment, data collected may include some or all of the following: URLs visited; duration 
of time spent at each site; keywords the user associates with each URL or site; explicit 
comments input by the user relating to each site; user evaluation of each site via voting; 
acceptance or rejection of recommendations related to URLs or people; data concerning other 
users whose name cards were accessed and/or collected; the people whom the user contacted; 

10 and user ID. Additional information may be obtained from the sites and URLs themselves, 
and other multimedia related data may be obtained, such as information related to television 
and radio broadcasts, people, MP3, addresses, telephone numbers, and the like. Thus, 
information terminal 180 may interact with TV station 196, radio station 197, and MP3 
station 199, in addition to Web server 198 in known ways. For example, television signals, 

15 including the vertical blanking interval within such a signal, have sufficient capacity to 
contain information such as program title, program summary, identification of actors and 
actresses, and the like. Similar information (e.g. song titles, composers, performers, lyrics, 
and the like) may be contained in parts of radio signals or MP3 information. Such techniques 
are well within the abilities of the ordinarily skilled artisan, and so need not be detailed here. 

20 Such information also can be accessed as part of the identification of user preferences, and 
matching with other users and their preferences, as will be discussed in greater detail herein. 
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As a user visits various sites during browsing sessions, relevant information is 
collected at the client side and transmitted to the central server 120, where it may be stored in 
appropriate database records associated with the user, the URL or site itself, or both, as 
discussed below. 

5 In one embodiment of such an information collection strategy, monitoring is done via 

the information terminal 180 using browser software or other Internet tools. The information 
terminal 180 is connected to various information sources, such as TV station 196, radio 
station 197, and MP3 station 199 in Fig. 1. The information terminal 180 in turn 
communicates with, or in some circumstances controls, a multimedia terminal 185. Such 

10 multimedia terminals may include a radio, a television, a telephone, a PCS, a PDA, a video 
recorder, a satellite or cable television receiver or set top decoder box, or other 
microprocessor based device capable of receiving and transmitting data. In this embodiment, 
the foregoing and various other types of multimedia terminals may be suitably connected, 
such as via wire, cable, power line, or wireless networking technology which is known in the 

15 art, to an Internet capable terminal such as information terminal 180 equipped with browser 
software modified with the system's client side program code 140. Through appropriate 
hardware connections and software communications, the system may be adapted to monitor 
activity conducted on these other terminal types. Jini (tm) code is well suited to enable links 
to activate appropriate devices. It is well within the abilities of the ordinarily skilled artisan 

20 to write appropriate implementing programs using Jini (tm). There also are other techniques, 
such as the I-mode of Nippon Telephone and Telegraph (NTT), in accordance with which a 
user can turn a television or radio on or off via signals from the user's cellular telephone. 
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For example, the program code 140 may include suitable routines for communicating 
with a network interface device 182, such as an infrared or radio frequency transceiver, a 
power line network adapter card, or other conventional network hardware known in the art, 
for interacting with a multimedia terminal 185. The program code 140 may receive and send 

5 data so as to monitor activity at one or more of the information sources 196-199. 
Information in the form of implicit data, such as television channels or radio stations to 
which the TV set or radio (as examples of a multimedia terminal 185) may be tuned, program 
titles and themes such as are provided for on screen display by cable or satellite 
programming providers, telephone numbers dialed, duration of activity, and the like may be 

10 transmitted from the multimedia terminals 185 through the network interface device 182 to 
the program code 140 at the information terminal 180. As with HTML data, the program 
code 140 may subsequently transmit the multimedia data collected in this manner to the 
central server 120 for classification, recordation, and aggregation. A user may wish to 
deactivate this implicit multimedia data monitoring function. 

15 Software control of the network interface device 182 through program code 140 

enables not only client side implicit data monitoring of user activity at one or more of the 
multimedia terminals 185, but also facilitates control of such multimedia terminals 185 from 
the information terminal 180. If a user selects an HTML link representing a television 
program, for example, the browser software in information terminal 180 may communicate 

20 that selection to the program code 140, which, in turn, may send data commands through the 
network interface device 182, activating a television set or other multimedia terminal 185 
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connected to the system and tuning the receiver to the appropriate channel, such that the 
television program is displayed on the television. 

It should be noted that, while for ease of description network interface device 182 and 
multimedia terminal 185 are shown as separate from information terminal 180, it is within 
5 the contemplation of the invention to incorporate one or more types of multimedia terminal 
185 into information terminal 180, as well as appropriate network interface devices 182. 

Figure 2 is a simplified diagrammatic view of one embodiment of a distributed 
monitoring process employed by the present invention. The central server 220 in Fig. 2 
represents the same central server 120 discussed above with reference to Fig. 1; additionally, 
10 the database 225 maintained at the central server 220 is further represented in the diagram. 
As noted above, user activity is monitored on the client side, as opposed to the server side; 
this client side monitoring scheme is distributed, or open, to the extent that the overall 
functionality of the monitoring system is not dependent upon the capabilities of individual 
sites or servers, nor is the utility of the information collected limited to a particular site. 

15 Looking more closely at Fig. 2, for example, a plurality of Web sites are indicated as 

reference numerals 293-299; a plurality of users' Internet capable computer terminals, each 
of which may be at a different remote location, are designated as reference numerals 210- 
214. As indicated by the solid arrows in Fig. 2, a user at a remote terminal 210 may visit 
Web sites 293, 295, 297, and 299. One advantage of a distributed monitoring system as 

20 illustrated in Fig. 2 is that information concerning each visited Web site 293, 295, 297, and 
299 may be transmitted, via the program code discussed above, from the remote terminal 210 
to the central server 220, irrespective of whether or not local monitoring is supported at the 
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Web sites 293-299 themselves; this transmission of data to the central server 220 is 
represented by the dashed arrows in Fig. 2. As discussed above, transmission of both 
implicit data, representing monitored user activity, and explicit data, representing user 
opinion, comment, recommendation, voting behavior, and the like, is supported. 

5 The client side data monitored at the remote terminal 210 may be categorized and 

sorted by program code at the central server 220 and stored accordingly in the database 225. 
It will be appreciated that with increasing numbers of remote terminals 210-214 actively 
contributing to the distributed monitoring system, increasing amounts of information may be 
accumulated in the database 225 and may subsequently be available for retrieval. 

10 Whereas the utility of the knowledge base accumulated on the server side at a specific 

site or company is limited to the server, site, or company where the data is monitored and 
collected, the strategy of monitoring user activity on the client side as the user browses from 
site to site enables the system illustrated in Fig. 2 to collect data from a multiplicity of 
sources and to implement accumulated data in locations other than where it was originally 

15 collected. The result is an open knowledge base, resulting from the inventive distributed 
monitoring system which monitors at the client side, rather than the server side. 

Figure 3 is a simplified diagrammatic view of one embodiment of a data aggregation 
process employed by the present invention which may use a distributed monitoring scheme, 
such as the one illustrated in Fig. 2, for data acquisition from multiple remote sources. In 
20 contrast to the situation illustrated in Fig. 2, all of the remote terminals 310-314 are actively 
engaged in some sort of network activity, and the program code installed in each respective 
terminal 310-314 is monitoring the activity on the client side, irrespective of any local server 
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side monitoring occurring at the respective Web sites 393-399. As indicated by the dashed 
arrows in Fig. 3, each remote terminal 310-314 may transmit, via the program code installed 
at each respective terminal, both implicit data concerning each of the Web sites 393-399 
visited by a particular user, as well as explicit data concerning each respective user's 
5 feedback, comments, recommendations, voting behavior, and the like. 

By way of contrast with Fig. 2, the situation illustrated in Fig. 3 involves all of the 
remote terminals 310-314 gathering data for recordation in the database 325. Accordingly, 
data which are ultimately categorized and stored in the database 325 are correspondingly 
more numerous. The central server 320 may be designed such that the database 325 has 
10 sufficient capacity for the number of users employing the system, and may be scalable to 
accommodate more traffic as the number of users expands. 

Turning now to the structure of the database 325 itself, it is important to note that 
each respective user of the system may have an associated database record at the central 
server 320. In any given user record, a wide variety of information concerning that user's 

15 preferences and habits, i.e. a user profile, may be maintained. This recorded data may be 
accessed and utilized by the system to personalize each respective user's interactive 
experience, for example, through customized ranking of search results or through targeted 
site recommendations. Additionally, user profile data, and in particular, explicit data input 
by a user, may be made available to other users, such that users, as well as the knowledge 

20 and recommendations each has to offer, are recognized by the system as searchable 
information. The previously-mentioned open knowledge base thus constitutes the searchable 
information to which the system has access. 
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Further, statistical data may additionally be stored in the database 325. This 
statistical data may be considered universal, or "global," to the extent that it is not necessarily 
related to a single user's profile. Rather, statistical data may be related to the URLs most 
frequently visited by all the users on the system, keywords associated therewith, recent topics 

5 of interest, and the like. As will be appreciated from examination of Fig. 3, the system and 
method of the present invention are adapted to aggregate much more information than each 
of the individual Web sites 393-399, for example. Additionally, the explicit data input by 
each respective user are not merely limited to HTML technology, though such data may be 
accessible through HTML links. For example, a user may provide information related to 

10 television, radio, or video broadcasts, for example, which are not in HTML format; as a 
result, statistical data may be maintained independent of the HTML documents which are 
ordinarily searchable on the World Wide Web. 

Figure 4 is a simplified diagrammatic view of one embodiment of an open 
recommendation system and method of the present invention. As noted above, in addition to 
15 accumulating records in a database 425 maintained at a central server 420, the system and 
method of the present invention are adapted to implement recorded data in such a way as to 
customize otherwise impersonal aspects and features of Internet activity which are currently 
supported. 

In contrast with the data collection and aggregation situations depicted in Figs. 2 and 
20 3, respectively, Fig. 4 illustrates that the direction of data flow is reversed during the data 
implementation, or open recommendation, process. Program code 421 maintained at the 
central server 420 may be adapted to communicate with various remote terminals such as 



those denoted by reference numerals 410-414. Responsive to a request from the program 
code installed at a remote terminal, such as terminal 412, the program code 421 may extract 
data from the database 425 on demand, manipulate the data according to the nature of the 
operation requested by the remote terminal 412, and subsequently transmit results of any data 
5 manipulation to the terminal 412. 

As a result of the data manipulation, terminal 412 then may communicate with other 
terminals, such as terminal 410 or 413. Terminal 412 also may access Web sites, such as 
Web site 495 or 499. As discussed above with reference to Fig. 1, in response to information 
received from the program code 421 at the central server 420, the program code installed in 
10 terminal 412 may also communicate with a network interface device, which may operate to 
control a multimedia terminal such as a television, video or audio receiver or recording 
apparatus, and the like. This further communication is directed to assisting the user of 
terminal 412 in retrieving data, broadcasts, or other information consistent with the 
information in that user's profile. 

15 In operation, the program code 421 serves to interface each remote terminal 410-414 

connected to the central server 420 with the functionality of the system which, in turn, is 
enabled by the accumulation of the data records in the database 425. As noted above, the 
program code 421 may ultimately be responsive to a request from the program code installed 
at any of the remote terminals connected to the central server 420; accordingly, the functions 

20 performed by the program code 421 vary as described below. 

Figure 5 is a general diagrammatic overview of the functionality of the system and 
method of the present invention. The central server is designated as reference numeral 520 in 
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the center of the figure, and may include database records for HTML data and multimedia 
data, represented by reference numerals 522 and 523, respectively. The items illustrated on 
the left side of Fig. 5 represent, by way of example, the nature of implicit data and explicit 
data which may be collected and aggregated as discussed above with reference to Figs. 1 
5 through 3. As illustrated in Fig. 5, data monitored at the client side may be aggregated, 
analyzed, and organized at the central server 520, and subsequently stored in appropriate 
records according to user profiles, statistical records, HTML data, multimedia data, and the 
like. 

Matching engine 550 accepts input from HTML database 522 and multimedia 
10 database 523, as well as from central server 520, and categorizes user profiles and tastes as 
follows. A number of categories (for example, 100) may be created. For each category, 
there may be a set of words (for example, as many as 100) related thereto. For example, for 
"fashion," there might be words such as size, color, style, brand, gender, and the like that 
would be associated therewith. For "sports," words such as baseball, football, golf, hockey, 
1 5 and the like may be associated therewith. 

Identification of appropriate key words for given categories may be carried out in 
various ways. One way would be to obtain, beforehand, a number of pages which would be 
relevant to a given category, and conduct lexical analysis to identify non-generic words that 
occur with some predetermined frequency within those pages. Those non-generic words then 
20 would be part of the overall list of key words. 

By identifying categories and words associated with each category, there may be a 
concordance of, for example, 10,000 key words, occurrences of which the client side 
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software will monitor as a user moves to various Web pages. There may be weighting 
associated with different key words, such that certain words, such as "baseball" may receive 
a different weighting within a sports category than something slightly more specific, such as 
"pitcher" or "catcher". This weighting may be static, i.e. predetermined, or may change 
5 depending on various considerations. 

The weighting may be determined based on a list of user preferences provided 
beforehand. For example, if a user identifies sports as an area of interest, instances of the 
word "baseball" in Web pages that the user accesses may be given a higher weighting than in 
the case of a user who did not identify sports as an area of interest. As a specific example, 

10 someone interested in movie stars may access Web pages that talk about Marilyn Monroe, an 
actress who was married to Joe DiMaggio, a baseball player. The Marilyn Monroe reference 
may be weighted more heavily for that user than the reference to DiMaggio. On the other 
hand, a user may be interested in sports, and may access Web pages that talk about Joe 
DiMaggio, who was married to Marilyn Monroe. For that user, the DiMaggio reference may 

15 be weighted more heavily than the refernce to Marilyn Monroe. 

As the client side software identifies and counts up occurrences of ones of these key 
words, the database at the server side is updated for that particular user, and in this way, that 
user's preferences and desires are categorized to an ever-increasing level of detail. 

20 With the user preferences thus identified through key word occurrences in accessed 

Web pages, it is possible to identify vectors, as it were, of user preferences. Those vectors, 
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which can be weighted appropriately, then can be matched with vectors of other users, to 
identify users with similar interests. 

Program code at the central server, such as the program code 421 discussed above 
5 with reference to Fig. 4, supports the recommendation functionality depicted on the right side 
of Fig. 5. While many ranking or sorting engines are known in the art, for example, current 
technology does not factor user profile data into the ranking procedure. It will be appreciated 
by those of skill in the art that programming scripts may compare user profile data with 
statistical data or information parsed directly from HTML code, for example, in order to 
10 assign a relative measure of compatibility to a particular site with respect to a particular user. 
Similarly, programming scripts may compare the respective data stored in two users' 
respective profiles and subsequently compute a relative measure of the compatibility of the 
users themselves. 

In operation, user profile information, such as most commonly visited URL, most 
15 frequently submitted keyword, and the like, may be employed by the program code in the 
recommendation and ranking engines illustrated in Fig. 5. Output from the various 
recommendation and ranking engines may be transmitted to the program code installed at a 
remote terminal, which in turn, provides a display for a user. The display may be interfaced 
with the GUI provided by standard browser software code, as discussed above with reference 
20 to Fig. 1. 

Regarding the exemplary capabilities of the system, designated generally by reference 

numerals 524-529 depicted on the right side of Fig. 5, the program code at both the central 
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server and the user's remote terminal can provide many advanced features in a single, 
comprehensive system. For example, the present invention may provide access to a BBS- 
type service, such as illustrated at 529, enabling users to exchange text-based comments on 
various topics of interest. The system and method may additionally provide access to 
5 multimedia formats which cannot be displayed by HTML based browsers, such as television 
or radio broadcasting, for example. Hyperlinks displayed on the HTML page may be 
employed to link the user directly to a particular television program or other live broadcast. 
Selecting such a link displayed on the HTML page may activate one or more software 
programming scripts designed to interact with the other media beyond the scope of HTML. 

10 Conveniently, for example, the system and method of the present invention may be 

adapted to provide an advanced search engine ranking and filtering function which takes 
advantage of monitored data and user voting history recorded at the central server for all 
users of the system. Such a ranking engine is designated as reference numeral 524. The 
conventional search engine is driven by predetermined algorithms based upon keyword 

15 frequency or metatags, and consequently provides search results, or "hits," which are based 
upon neither a measure of the real relevance of the various sites with respect to a given 
keyword nor users' evaluations of the information provided at those sites. The present 
system and method, on the other hand, enable more meaningful search results by utilizing the 
implicit and explicit data aggregated in the central database to refine searches. 

20 According to one embodiment, statistical data concerning, for example, the number of 

users who have provided positive feedback and the keywords each user most frequently 
associates with a particular site, may be employed by the ranking and filtering engine 524; 



authoritative, popular, and relevant sites may be identified through such filtering, and ranked 
accordingly. Importantly, the information collected concerning sites is not limited to portal- 
type technologies, which favor information offered by licensed content providers. Since data 
is collected at the client side and aggregated at a central server, the information available to 
5 be recorded and subsequently implemented is not selectively limited by the operator of a 
portal site or other information content provider, but rather extends to any and every site 
which may be reachable from the client side. 

Using explicit and implicit monitored data to evaluate each URL which is a potential 
search result eliminates the likelihood of dead links being returned as hits, since users are 
10 unlikely to provide a vote of approval for such sites, and consequently, fewer users are likely 
to visit such a site. By compiling data from registered users in the database within the central 
server 520, search results for any given query will improve over time; as more data is added 
to the database, dead links and irrelevant sites may be eliminated and the most relevant sites 
may be identified more easily. The improving results are in contrast to known search 
15 engines, which may provide more results as Web pages are added, but not better or otherwise 
different results. 

Similarly, a personal or customized search engine 525 may be based upon privately 
monitored user profile data concerning each respective user of the system. Individual data, 
extracted from a specific user profile, for example, may be analyzed and compared with the 
20 foregoing statistical data for each user conducting a search. The system, therefore, may 
provide customized search results by utilizing user profile information collected for each 
respective individual and comparing that user profile data with the statistical data concerning 
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a given potential search result. Each prospective search result may be weighted or ranked, 
for example, at least partially as a function of the comparison with the user profile data. 

In the situation where two users conduct a search with the query term or keyword 
"automobile," for example, the two users may not have the same types of automobiles in 

5 mind. The first user may favor sports cars, and thus there may be data in the first user's 
profile concerning auto racing, driving schools, and a sports car driver's club; the second 
user, on the other hand, may favor sport utility vehicles, as evidenced by records in the 
second user's profile concerning 4-wheel drive vehicles and maps of the best places to drive 
a sport utility vehicle off-road. Given the identical keyword "automobile," the first user's 

10 search results may have sports car related hits ranked with higher relative priority or weight, 
while the second user's search results may have sport utility vehicle related hits ranked with 
higher relative priority. Such a search strategy reflects each person's individual profile data, 
and may provide the search results in order of decreasing relevance with respect to each 
searching user's unique profile data, for example. 

15 As alluded to earlier, search results will improve as the user's profile becomes more 

comprehensive, and as more data is added to the database. For example, if the sports car 
enthusiast's preference for Ferraris becomes part of the profile, search results will tend to 
focus on those particular sports cars, or very expensive cars, or Italian sports cars. 

As noted above, a multimedia search engine 526, utilizes and accesses various kinds 
20 of information provided via user or human input. Accordingly, search results, for example, 
may include itemized lists or information concerning people, television, video, MP3, and 
others, in addition to ordinary URL results. Where a data record exists in the database, for 
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example, associating a television program's theme with a particular keyword, a search of that 
keyword will return any URLs which may be relevant as well as the television program 
record. Since human knowledge is incorporated into the system and catalogued in the central 
database 520, information outside the realm of HTML, such as television, radio, MP3, and so 
5 forth, may be recorded in the database 520 where implicit monitoring of a multimedia 
terminal has been conducted as discussed with reference to Fig. 1, or where one or more 
users have provided explicit data concerning a multimedia topic. The database 520 
constitutes the open knowledge base which has been discussed above. 

The references to these other media may be expressed in HTML, and links may be 
10 provided for access thereto. Through selection of a link representing a television program, 
for example, the corresponding television program may be displayed on a television 
connected to the system. Universal remote controls, for example, and other types of wireless 
or wire based networking devices or adapter cards, currently enable a single device to control 
various other terminals via infrared or radio frequency transmissions or other communication 
15 techniques. Where such a device is operatively coupled to a user's computer terminal and 
software controlled by the program code embedded in the user's Internet browser software, 
an HTML hyperlink may be used, for example, to tune a digital radio receiver, to program a 
video cassette recorder to record a particular television channel at a particular time, and so 
forth. 

20 Further, since the system has access to explicit data input by users, various 

associations may be made, depending upon the nature of explicit data recorded. If a user 
were to conduct a search using the title of a television program as keyword query, for 
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example, the search result may yield the names of people who share preferences and habits 
which are consistent with the theme of the television program. By following a link from one 
of the people returned as a search result, which link may be shown on the HTML page, the 
television program may be viewed on a television connected to the system. Again, as the 
5 wealth of information recorded in the database becomes more comprehensive, the more 
capable the system will become with respect to associating various users' profiles with data 
concerning multimedia topics. 

The program code embodied in the present invention further may utilize both explicit 
user input and implicit user data, derived from user behavior, to categorize preferences and 

10 biases of an individual user and to store monitored information in a central database. 
Importantly, these preferences and biases may subsequently be employed by the open 
recommendation system and method to personalize the process of information retrieval and 
to assist the user in making decisions concerning information which may be most desirable 
for a given user. A targeted site referral system or recommendation engine, for example, 

15 may suggest particular URLs based upon known user tendencies and statistical data related to 
various sites. 

Since a vast array of data is stored at the central database concerning HTML pages, 
for example, the system may employ the foregoing program code to recommend certain sites 
which are compatible with an individual user's profile. Such a site recommendation engine 
20 is designated as reference numeral 527. Where the system makes recommendations or 
suggestions based upon information known about the various sites recorded in the database, 
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as well as upon explicit and implicit data concerning the user's preferences, information 
location and retrieval may be more efficient for the user. 

Again, since the present invention utilizes a distributed monitoring system which 
collects data at the client side, the site recommendation engine 527 is not limited in scope to 
5 the few sites favored by a particular portal operator or accessible through conventional search 
engines, for example. The present site recommendation engine 527 may identify and suggest 
good content or compatible sites irrespective of location or domain; so long as the site is 
accessible from the client side, information concerning the site may be maintained in the 
database, and accordingly, the system may recommend the site to potentially interested users. 

10 In addition to Web sites or other information sources which are likely to be of interest 

to a particular user, for example, the program code of the present system and method may be 
adapted to offer recommended or suggested people, as indicated at reference numeral 528, 
with whom a given user may be interested in communicating. Through efficient utilization 
of the implicit and explicit data collected for each user, the present invention allows one user 

15 to find other people to contact who have similar interests, as reflected in their respective user 
profiles. The program code may analyze user profiles, for example, and recognize which 
users have corresponding or similar interests, based upon implicit and explicit monitored 
data, thereby facilitating communication between users, and encouraging information 
exchange on a large scale. 

20 Similar to the site recommendation engine 527 discussed above, the person 

recommendation engine 528 is not limited to the closed group of registered members of a 
particular newsgroup, chat room, or BBS, but rather may extend to the entire universe of 



people who have a profile on record in the system. Since detailed records, based upon data 
monitored on the client side, are maintained at the central server 520 concerning every user's 
profile, the system and method of the present invention may provide a user with a list of 
recommended or compatible people, notwithstanding the fact that they are not registered 
members of the same newsgroup as the user, for example. 

Additionally, the system and method of the present invention are especially well 
adapted for indicating similarity or discrepancy between data contained in a particular user 
profile record and data related to various sites or to other users. For example, matching 
degree meters, or compatibility gauges, may be provided for displaying the degree of 
similarity between the user and a particular URL, or between the user and other members or 
users of the system. A graphical display or numerical indication may be provided to the user 
with respect to the degree of "matching" or similarity. 

It will be appreciated that the degree of similarity between a user and a given site, for 
example, may change as a result of various factors: the content of a particular site may vary 
over time, for instance, or the information in the user's profile may change during ordinary 
and increasing use of the system. Additional matching information advantageously may 
include a measure of the relative compatibility of one user with respect to the other people 
making use of the system. Given an indication of the relative matching of information in 
various users' profiles, one user may select the link to the user having the most closely 
matched profile rather than selecting a different user whose preferences appear to be 
relatively incompatible. 
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Figure 6 is a representation of the GUI employed by standard Web browser software, 
such as browser software 180 discussed above with reference to Fig. 1, which has been 
modified in accordance with the present invention. As discussed briefly above, the program 
code installed at a remote terminal may communicate with the central server as well as 

5 provide an interface through which the user may access the functionality of the system. The 
exemplary GUI depicted in Fig. 6 provides a convenient interface to the foregoing interactive 
functions described above with reference to Fig. 5. Upon installation at a remote server, the 
program code may take advantage of the interface mechanisms, or "hooks," in the standard 
browser code, and may install an additional frame in the GUI, such as the tool bar 641 in Fig. 

10 6. As is known in the art, the tool bar 641 or other interface frame may contain operative 
buttons, drop-down menus, text entry boxes for initiating search features, and other GUI 
devices for accepting user input, as well as advertising banners and the like. 

In one exemplary embodiment, the tool bar 641 may contain operative buttons or 
drop-down menus which provide access to explicit data entry scripts. Through selection of 
15 an item from a menu, for instance, the user may be presented with a text box or window into 
which comments, recommendations, and other information may be entered; additionally or 
alternatively, messages to other users may be posted in a manner similar to current 
newsgroup or BBS technology, or sent in real time. Any such explicit data provided may be 
recorded and aggregated at the central server for use by the system as described below. 

20 Further, the tool bar 641 may include operative buttons which allow voting, for 

example, enabling a user to express positive or negative reviews of a particular Web site, 
person, current event, or other topic of interest. In Fig. 6, the voting button is signified by a 
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heart in the tool bar 641 . The tool bar 641 may also provide a text box 648 for accepting user 
keyword input for the search engine functionality of the system. 

Taking user privacy concerns into consideration, the system and method of the 
present invention may be adapted selectively to be deactivated. To support such a feature, an 
5 operative button or other mechanism in the tool bar 641 may be provided so as to enable a 
user to turn off or otherwise to disable the implicit data collection functionality of the system. 
In Fig. 6, the deactivation function is signified by a button displaying the word "Off in the 
tool bar 641. 

In one embodiment, a "compatibility gauge" 647 may be provided in the tool bar 641 
10 for providing a measure of the compatibility of the user profile data relative to the statistical 
data related to the site which is currently loaded in the browser software. A sliding scale, 
graph, or other type of meter may give the user a visual or numerical indication of how 
closely matched the statistical data related to the current site is to that data stored in the 
central database concerning the user's preferences. In Fig. 6, the relatively high level shown 
15 in the compatibility gauge 647 serves as an indication that the user's profile data is relatively 
consistent with the statistical data maintained at the central database related to "Hamburger 
Web Site." As the user browses from site to site, the level indicated by the compatibility 
gauge 647 changes as each successive page or site is loaded into the browser, such that a 
display of compatibility may be provided for a user in the tool bar 641 during ordinary 
20 browsing or searching activities. 

Also, as discussed above with reference to Fig. 1, the program code incorporated into 
the system may enable the following functions in the tool bar 641: search engine features; 



banner advertising, which may be optimized or customized according to monitored user 
preferences; survey or voting functions, which facilitate explicit data input concerning Web 
sites, people, and so forth; and lotteries, slot machines, or other types of games of chance. It 
will be appreciated by those of skill in the art that incorporation of these features in the tool 
5 bar 641 may be accomplished through appropriate program scripts. 

Figures 7 and 8 are respective representations of search results ordered in accordance 
with different respective aspects of the present invention. The main browser windows in 
Figs. 7 and 8 show examples of search results which have been obtained through the ranking 
engine 524 and the personal ranking engine 525, respectively, of the present invention. A 

10 keyword query may be entered at the text entry box 749 or 849, for example, which is 
provided by the HTML of the site currently loaded in the browser; alternatively, as discussed 
above with reference to Fig. 6, search engine program scripts may be accessed through a 
keyword query entered at a text entry box 648 provided in the tool bar 641. Responsive to 
such a keyword query or other search request, program code at the central server may 

15 orchestrate or oversee a database search for the query term. 

The initial search may be conducted by program code at the central server, or it may 
be "out-sourced" to a large scale Web portal site or Web-based search engine. The results of 
such an initial search may be general, or standard, to the extent that the information retrieved 
may not be filtered or personalized for the particular user conducting the search. These 
20 initial results may subsequently be filtered through the ranking and filtering engine 524, 
either in isolation or in conjunction with the personal ranking engine 525, as discussed above. 
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The returned results may be ranked in accordance with the original, general search; 
alternatively, after filtering, for example through the ranking engine 524, the results may be 
ordered according to the number of users providing explicit data in the form of voting input 
with respect to each hit. The first three results, or hits, indicated by the bracket at the left 
5 side of the Fig. 7, are designated by reference numeral 742. In accordance with one aspect of 
the invention, the ranking is ordered in accordance with the number of votes each respective 
site has received from browsing users. Thus, the results in the illustrated example shown in 
Fig. 7 are ranked in order of voting users, with the number of voting users shown adjacent 
each respective hit and its corresponding compatibility gauge. 
10 In the illustrated example shown in Fig. 7, where the user has input a query to find 

out information about "hamburger," the largest number of users indicated approval of In 'n 
Out Burger, and therefore this results is ranked first in Fig. 7. Wendy's was selected by the 
second largest number of users, and so this result is ranked second. Therefore, a user may 
find, or be led to the conclusion that, "In 'n Out" contains more relevant information than 
15 "Wendy's." In Fig. 7, as noted above, the order of hits is determined by the number of users 
providing explicit voting data for each respective search result; these results have not been 
subjected to the personal ranking engine 525 of the present invention. 

As a consequence of relying on group voting preferences, the degrees of correlation 
between the user's preferences and the data concerning the various search results in the list 
20 have not been considered by the system prior to presentation of those results to the user. 
That this is the case can be seen in that, although "Carl's Jr." has received the fewest number 
of votes, thereby placing it at the bottom of the list of hits, the corresponding compatibility 
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gauge indicates the highest degree of relative correlation between the user's preferences and 
the information available in "Carl's Jr." Therefore, the user may choose to navigate to 
"Carl's Jr." first, notwithstanding its relatively low placement in the ranking, based upon the 
compatibility gauge. It will be appreciated from examination of Fig. 7 that the ranking 
5 engine 524 may provide a user with both information concerning each hit's general 
relevance, reflected in the number of voters and the order of the hits, as well as information 
concerning the compatibility of the user's profile data with the information available at each 
site. 

Additionally, in the case where the personal ranking engine 525 is employed, the 
10 search results may be presented in a personalized order according to a comparison of data 
concerning each hit and data contained in the user's profile. In the illustrated example shown 
in Fig. 8, where the user has input the same query to find out information about "hamburger," 
the largest number of users, again, indicated approval of In 'n Out Burger. Wendy's was 
selected by the second largest number, as in Fig. 7. In contrast to Fig. 7, however, the results 
15 in Fig. 8 have been manipulated by the personal ranking engine 525, such that user 
compatibility with the site affects the order in which the hits are returned. Though "Carl's 
Jr." received the fewest votes, the site is displayed as the first hit on the list, by virtue of its 
having the highest relative degree of compatibility with the user's profile data. 

That is, upon completion of the initial search, the system of the present invention may 
20 employ programming adapted both to filter search results to provide meaningful search 
results in general, as well as to rank or to order search results according to the data 
maintained in each user's profile. According to one embodiment of the invention, statistical 
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data related to each URL may be extracted from the database and implemented to organize 
search results according to the most popular, or most frequently visited, URLs. Such 
statistical data enables URLs to be examined in light of all the data accumulated by all of the 
users participating in the distributed monitoring system. This embodiment represents an 

5 improved, large scale search filtering engine for the masses, such as the ranking and filtering 
engine 524 in Fig. 5, exemplary results of which are displayed in Fig. 7. According to 
another embodiment of the present invention, programming code at the central server may 
extract database records both for the user conducting the search, i.e. user profile data, as well 
as for the URLs returned as results of the search, i.e. HTML data. This embodiment 

10 represents a unique, small scale, personalized search filtering engine, such as the personal 
ranking engine 525 in Fig. 5, designed to tailor search results accordingly to user preferences, 
as depicted in Fig. 8. 

A comparison of the extracted database records may provide an indication that, for 
example, a particular HTML document contains, or may be related to, information which the 

15 user finds especially interesting. A high degree of correlation between extracted records may 
be interpreted by the system as compatibility between the user's interests and the site's 
contents; conversely, a low degree of correlation, or a discrepancy, between the user's 
preferences and the contents of the site may be interpreted by the system as incompatibility. 
The programming code at the central server may be adapted to perform such a compatibility 

20 analysis with respect to every URL loaded into the user's browser or returned as a hit in a 
search. 
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As noted above, the results of such a personalization of the rank or order of search 
results may be displayed to the user in a variety of ways. In Figs. 7 and 8, each respective hit 
is displayed adjacent a respective compatibility gauge, such as the meter 743 or 843, for 
example, and a numerical indication of the number of users who have input votes concerning 
5 the hit. The levels indicated by the respective meters serve as a graphical display of the 
correlation of data in the user's profile and data concerning the corresponding search result as 
calculated by the personal ranking engine through the compatibility analysis discussed above, 
for example. 

Alternatively, the program code at the server may be adapted to re-order the search 
10 results, such that those URLs which are most compatible with the user's profile data may be 
displayed higher in the list, hi such an embodiment wherein the rank or order reflects 
compatibility with the user's preferences, a compatibility gauge, such as the meter 843, may 
not be required or desired, since the relative degree of compatibility with the user's profile 
data will be factored into the determination of the order in which the URLs are presented to 
15 the user, as in Fig. 8. Though relative degrees of matching are reflected in the rank of the 
hits in such an embodiment, compatibility gauges may be provided nevertheless, for 
displaying to the user an indication of the absolute degree of correlation between the user 
profile data and the HTML data. 

As an example, a list of users, indicated by the bracket on the right side of Figs. 7 and 
20 8, and designated by reference numerals 744 and 844 respectively, has been ranked in 
decreasing order of compatibility with respect to the user conducting a search. The ranking 
may be determined by a similar programming procedure as the compatibility analysis 
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discussed above with reference to HTML data; in this case, however, user profiles may be 
extracted from the database and examined for similarities and differences. The degree of 
correlation between user profile data may be interpreted by the programming code to relate 
directly to the compatibility of the two users whose data profiles are compared. 

5 In this embodiment, though the order of the list of users 744 or 844 is indicative of 

relative compatibility with the user conducting a search, compatibility gauges, such as the 
meter 745 or 845, may be provided nevertheless, in order to provide the user conducting a 
search with an indication of absolute compatibility with respect to each other user who has 
previously commented on a particular site. In this way, the user conducting a search may 

10 make a judgment concerning the weight to be given to a particular vote, comment, criticism, 
or recommendation. The compatibility gauges thus may suggest to a user that linking to 
certain individuals (here, "NAMI") and accessing information via their name cards, etc., 
might be most productive. 

As indicated by the tree structure in Figs. 7 and 8, the list of users 744 or 844 may be 
15 an inclusive roster of people who have provided feedback, comments, or other explicit data 
related to the third hit in the list of URLs 742 or 842. Similar lists of users may be available 
for each hit returned in the search results. Such lists of users providing explicit data related 
to a particular URL may be selectively viewable. The system may be adapted to provide 
instant access to the explicit data provided by each respective listed user concerning the site 
20 in question; a user conducting a search may, therefore, obtain important information in the 
form of explicit user comments, votes, reviews, and advice, before navigating to a particular 
site. Importantly, this information has been provided by people who have previously visited 
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the site, and who have strong enough opinions about the site to input thoughts and comments 
explicitly. 

In addition to the meter 745 or 845 which serves as a visual indication of each listed 
user's compatibility with the searcher's profile, an operative button icon, such as that 

5 designated by reference numerals 746 and 846, may be associated with each listed user. 
Selection of the appropriate operative button icon 746 or 846 may provide the searcher with 
access to information related to the respective listed user. The program code at the user's 
terminal may invoke an additional window, for example, in which information concerning 
the listed user's e-mail address, profile data, favorite sites, comments, voting history, 

10 recommendations, and other explicit data may be available for review. Hyperlinks may be 
provided for convenient navigation to an e-mail server, for instance, or to recommended or 
highly favored URLs or multimedia functions. 

Returning, now to the tool bar 641 and the contents thereof, it will be appreciated by 
those skilled in the art that various GUI programming methods may be employed to enable 

15 users to access a wide variety of system capabilities, such as those depicted on the right side 
of Fig. 5. Access to one or more BBS or newsgroup systems may be provided in the tool bar 
641, for example, in the form of drop-down menus or operative buttons. By selecting such a 
service, users may gain access to text-based message windows, dialogue boxes, chat rooms 
for real-time communication, or the like. Users may post or transmit explicit data concerning 

20 any topic of interest as well as view other users' input. This functionality is supported by 
program code both at the central server as well as at the users' respective remote terminals. 



-46- 



Additionally, the tool bar 641 may provide an interactive element for invoking 
programming scripts which allow a user to offer explicit comments and feedback, or to vote 
positively or negatively, concerning the particular URL or site which is currently loaded in 
the user's browser. Such input may preferably be available to other users in the manner 
described above with reference to the list of users 744 or 844 who have previously visited a 
site. In this embodiment, selection of an operative button, for example, may invoke a text 
window for entering text comments. Such explicit data may be transmitted to the central 
server for recordation in a data record associated with the particular site; additionally, user 
profile data may be associated with the data record to enable the tree structure illustrated in 
Figs. 7 and 8. 

A personal recommendation function may also be accessed through the tool bar 641 
shown in Fig. 6, for example. As noted above with respect to the compatibility analysis, data 
records in the database at the central server may be extracted for analysis and comparison 
with other records. In the same way that compatibility ratings may be computed during 
ordinary user activity, a user may independently invoke a separate compatibility engine, such 
as recommendation engines 527 and 528 in Fig. 5, adapted for providing recommendations 
with respect to Web sites, people, places, and so forth. 

The program code at the user's terminal may accommodate a request for a list of 
other users who share an interest in a particular topic, for example. The programming scripts 
at the central server may analyze user profile records and provide a list of compatible users, 
or suggest a particularly popular Web site or newsgroup frequented by other users with 
similar interests. Similarly, the central server may employ code for recommending popular 
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or authoritative Web sites related to a specific topic and compatible with the user's profile 
data; in conjunction with the suggested sites, the system may additionally provide a list of 
other users who favor the sites, along with their respective explicit comments. 

Importantly, the system and method of the present invention are designed such that 
5 users' responses and reactions to the recommendation system may be monitored, and 
feedback may be encouraged; such monitoring results in the cyclic collection and 
aggregation of more and better implicit and explicit data. As a consequence, the system may 
develop an increasingly accurate and more complete profile of each user based upon implicit 
data monitored at the client side; additionally, the wealth of general knowledge and 
10 information accumulating in the records of explicit data may expand to provide increasingly 
more detailed and more useful reviews and recommendations concerning an expanding array 
of topics and people. 

Figure 9 is a simplified diagrammatic view of one embodiment of the distributed 
monitoring system of the present invention. As noted above with reference to Fig. 1, typical 

15 Web browser software 980 is customarily adapted to be modified by additional software, 
such as program code 940, which may both alter the GUI presented to the user by the 
browser software 980 as well as provide the foregoing additional functionality. In particular, 
as depicted in Fig. 9, the program code 940 interfaces with the browser software 980 and the 
operating system 990 of the remote terminal in order to enable the client side monitoring of 

20 user browsing activity discussed above. 

In operation, the program code 940 at the user's terminal may be adapted to accept 
data input directly from the user, and also may monitor the activity of the browser software 
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980 through the appropriate code in the browser software 980, such as the COM interface 
981. The COM interface 981 is a "hook" which permits a third party programmer to insert 
code which operates in conjunction with browser software 980. 

Data from the browser software 980 which may be collected includes the following: 
the URL currently loaded; the location of the browser window in the remote terminal's 
display; and various browsing events such as GUI device selections, download completions, 
and URL requests. Through interface with the operating system 990 of the remote terminal, 
the program code 940 may be apprised of the duration of the visit to a particular URL. After 
the appropriate data is collected at the client side, the program code 940 may transmit 
collected data to the central server for recordation, categorization, and aggregation with data 
collected from other users, as discussed above. 

From the foregoing, it can be seen that the system and method of the present 
invention provide versatile and personalized information retrieval functionality which 
addresses the nature of the search space and which is adapted for efficient user interaction. 
The preferred embodiments disclosed have been described and illustrated by way of example 
only, and not by way of limitation. Other modifications and variations to the invention will 
be apparent to those skilled in the art from the foregoing detailed disclosure. Thus, while 
only certain embodiments of the invention have been specifically described herein, it will be 
apparent that numerous modifications may be made thereto without departing from the spirit 
and scope of the invention. 
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